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Abstract: Pig welfare is closely related to the economical production of pig farms. With regard to pig welfare assess- 
ment, pig sounds are significant indicators, which can reflect the quality of the barn environment, the physical condi- 
tion and the health of pigs. Therefore, pig sound analysis is of high priority and necessary. In this review, the rela- 
tionship between pig sound and welfare was analyzed. Three kinds of pig sounds are closely related to pig welfare, 
including coughs, screams, and grunts. Subsequently, both wearable and non-contact sensors were briefly described 
in two aspects of advantages and disadvantages. Based on the advantages and feasibility of microphone sensors in 
contactless way, the existing techniques for processing pig sounds were elaborated and evaluated for further in-depth 
research from three aspects: sound recording and labeling, feature extraction, and sound classification. Finally, the 
challenges and opportunities of pig sound research were discussed for the ultimate purpose of precision livestock 
farming (PLF) in four ways: concerning sound monitoring technologies, individual pig welfare monitoring, commer- 
cial applications and pig farmers. In summary, it was found that most of the current researches on pig sound recogni- 
tion tasks focused on the selection of classifiers and algorithm improvement, while fewer research was conducted on 
sound labeling and feature extraction. Meanwhile, pig sound recognition faces some challenging problems, involv- 
ing the difficulty in obtaining the audio data from different pig growth stages and verifying the developed algorithms 
in a variety of pig farms. Overall, it is suggested that technologies involved in the automatic identification process 
should be explored in depth. In the future, strengthen cooperation among cross-disciplinary experts to promote the 
development and application of PLF is also nessary. 
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than half of the total output of pork, beef, mutton, 
and poultry". The huge demand for pork has led to 
a growing trend toward of modern pig production. 
Production intensification and specialization are 
two typical characteristics of modern pig farming"! 
which contribute to increased productivity of pigs, 
leading to the economic efficiency of production. 
Simultaneously, animal welfare is being a concern 
with the farm mode transformation in pig herds. 

In general, animal welfare includes three parts, 
i.e., natural living, affective states, and basic health 
and functioning" in different behaviors and various 
conditions. Monitoring animal body conditions is 
beneficial for both animals and farmers. First of all, 
animal health could be improved, which decrease 
usage of veterinary drugs and reduced mortality. 
Meanwhile, better animal health contributes to bet- 
ter animal emotion", Secondly, less cost in veteri- 
nary bills leads to direct financial benefits for farm- 
er and improved quality of pork". Most important- 
ly, animal health may directly affect human health. 
It can reduce the risk of zoonoses to monitoring the 
physical conditions of animals. It is becoming even 
more crucial to investigate the relationships be- 
tween good welfare, good health, and disease resis- 
tance with enormous commercial and social bene- 
fits in setting higher standards. 

Precision livestock farming (PLF) is to assist 
farmers in making appropriate management deci- 
sions to avoid certain risks by using real-time moni- 
toring technologies. Some reviews related to pig 
welfare and relevant technologies have been report- 
ed recently. Mahfuz et al.'” provided a general over- 
view and instruction on smart tools and applications 
in modern pig farming. Both non-invasive and inva- 
sive methods were involved and discussed. Tzanida- 
kis et al.’ summarized three main categories of non- 


intrusive technologies, including camera-based, mi- 


crophone, and communication information technolo- 
gy (CIT) sensors, and attempted to predict techno- 
logical developments in potential ways. Schillings 
et al.” affirmed the impact of sensor applications on 
animal welfare from two aspects of health and emo- 
tion. Meanwhile, both benefits and underlying risks 
of PLF were explored and discussed. Furthermore, 
Racewicz et al."analyzed different technologies to 
achieve effective monitoring of pig health. They em- 
phasized that pig health and welfare measures 
should be integrated with the data obtained to estab- 
lish reliable monitoring systems for pig production 
assessment. As a diverse range of smart sensors 
emerged from technology development, their com- 
mercial generation possibilities and contributions 
to welfare were also investigated and evaluated". 
The above reviews briefly described the rele- 
vant technologies applied in pig farms, demonstrat- 
ing the potential and value of PLF technology ex- 
plicitly. Moreover, it is worth remarking that previ- 
ous studies have acknowledged sound as a potential- 
ly useful indicator for inferring animal welfare. 
Sound analysis techniques and sensors have been 
developed rapidly in recent years, which provide 
the opportunities to achieve automated monitoring 
pigs and improve their welfare. The main purpose 
of this paper is to focus on the relationship between 
pig sounds and welfare based on microphone moni- 
toring. The existing pig sound processing technolo- 
gies were summarized and discussed in detail. 
Shortcomings and outlooks in terms of the related 
technologies were also elaborated. Finally, the chal- 
lenges and perspectives of sound monitoring with 


PLF in modern pig farms were proposed. 


2 Pig sound and welfare 


Sound carries emotional, physiological and in- 


11,12 


dividual information". It could be considered as 


potentially valuable indicators for discerning animal 
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welfare. Table 1 illustrates different welfare indica- 
tors related to pig sounds. It can be seen that studies 
on pig sounds were mainly devoted to coughs, 
screams, and grunts. In these articles, cough sounds 
can reflect air quality in pigsties and become crucial 
indicators for health monitoring. Regarding screams 
and grunts, they provided the reactions to the physi- 
cal conditions, which are beneficial to improve the 


pig welfare. 
2.1 Pig sound and environment 


In an intensive pig farm, air quality and heat 
stress are two influential factors associated with the 
living environment, directly affecting pig welfare 


"1 assessed heat 


and product quality. Ferrari et al. 
stress by analyzing continuous pig screams and 
grunts. It was reported that a peak frequency value 
higher than 750 Hz of both sounds was considered 
as an indicator of heat stress''*!, Amaral et al." dem- 
onstrated the relationship between the sound pres- 
sure levels (SPL) produced by piglet and the ther- 
mal environment of the pig nursery. A range of 
56.3~60.3 dB was regarded as a good indicator to 
assess thermal comfort. Meanwhile, cough sound 
was regarded as an objective and non-invasive bio- 
marker for the respiratory state in studies of expo- 


15189. Also, the results indicated 


sure to air pollutants 
that cough sound analysis could provide valuable 
and qualitative information about the air quality 
conditions in commercial livestock farming, with 
1737 Hz on average peak frequency of pig cough 
(472 Hz higher than the control group) under better 


1 assessed a base- 


indoor air quality. Pessoa et al.' 
line of cough free of respiratory disease and investi- 
gated the relationship between environmental condi- 
tions and cough frequency of pigs. It revealed a pos- 
itive correlation between ammonia (NH,) concentra- 


tion changes and continuous coughing of pig. 


2.2 Pig sound and physical condition 


Sound analysis has been applied to evaluate 
pig physical condition such as body temperature 


"5! identi- 


changes, pain, hunger and thirst. Moi et al. 
fied the differences in swine vocalization patterns 
according to different stress conditions (thirst (no 
access to water), hunger (no access to food), and 
heat stress). Pig was found to be thirsty when sound 
intensity ranged from 73.87 dB to 80.18 dB. With a 
value higher than 80.18 dB, it indicated that the 
pigs were hungry or under heat stress. For further 
confirming the pig's status, pitch frequency present- 
ed a difference, with the hunger of 212.87~276.71 
Hz and heat stress of higher than 276.71 Hz". The 
abnormal conditions by analyzing grunt, frightened 
screams and feeding howl sounds were also detect- 
ed". The results showed that the total sound rec- 
ognition rate could achieve 95.5%!"”. Besides, vocal- 
ization is a valuable tool for identifying situations 
of stress in pigs during the castration procedure”’*”. 
Without local anaesthesia, piglets uttered almost 
twice screams during the experiment. Also, screams 
characteristics are significantly different from 


grunts”! 


. Moreover, different acoustic parameters 
were beneficial for evaluating the level of pain in 
piglet management. The results showed that the val- 
ues of pitch, intensity, and maximum amplitude 
were enhanced from pigs in normal status to castra- 
tion”, Based on the researches of screams charac- 
teristics, representative features were focused on 
and taken into consideration to define pig screams 
for constructing a more accurate classifier. And pig 
screams were defined when the pig sound duration 
was longer than 0.4 s°!. A simple voting system 
was constructed to classify the screams with a preci- 
sion of 83%", With respect to specific emotion 


] 


analysis of pigs, Riley et al." and Moura et al.°” 


proved that phonations increased with fear and dis- 
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Sounds Indicators Welfare Production phase Year 
Coughs''° Air quality Environment Weaners 2019 
Coughs!" Air quality Environment Weaners 2004 

Coughs, sneezes! !™ Air quality Environment Fattening 2022 
Coughs'** Respiratory disease Health Fattening 2001 
Coughs'” Respiratory disease Health Fattening 2008 
Coughs!” Respiratory disease Health Fattening 2008 
Coughs’! Respiratory disease Health Fattening 2008 
Coughs” Respiratory disease Health Fattening 2008 
Coughs? Respiratory disease Health Fattening 2008 
Coughs * Respiratory disease Health Fattening 2008 
Coughs * Wasting disease Health Fattening 2010 
Coughs * Respiratory disease Health Fattening 2013 
Coughs” Wasting disease Health Weaners 2013 
Coughs * Wasting disease Health Weaners 2016 
Coughs” Respiratory disease Health / 2020 
Coughs“ Wasting disease Health / 2020 
Coughs” Respiratory disease Health Fattening 2021 
Coughs” Respiratory disease Health Fattening 2021 

Coughs, screams“?! Respiratory disease Health Farrowing 2016 

Coughs, screams “*! Respiratory disease Health Farrowing 2020 
Coughs!“ Respiratory disease Health / 2017 
Coughs!“ Respiratory disease Health / 2019 
Coughs!“ Respiratory disease Health / 2020 
Coughs‘** Respiratory disease Health 4 2020 
Coughs'” Respiratory disease Health Fattening 2019 
Coughs” Respiratory disease Health Fattening 2019 
Screams *'! Stress Health Piglets 2009 
Screams”! Stress Physical condition Piglets 2008 
Screams’! Stress Physical condition Piglets 2009 
Screams”! Stress Physical condition / 2012 

Screams, grunts "™ Stress Environment Piglets 2013 
Screams"! Stress Environment Piglets 2020 
Screams '*! Stress Physical condition / 2014 

Screams, grunts’ Stress Physical condition Piglets 2003 
Screams **! Stress Physical condition Fattening 2015 
Screams *! Distress Physical condition Farrowing, nursery, growing, and finishing 2018 
Screams 7°! Distress Physical condition Piglets 2016 
Screams"? Stress Physical condition Piglets 2018 

Grunts, screams, howls''*! Distress Physical condition Farrowing 2020 
Coughs, screams, howls 7°! Distress Physical condition / 2017 
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2.3 Pig sound and health 


Both screams and coughs could be considered 
as direct indicators for monitoring pig healthy con- 
dition. For instance, the screams of healthy piglets 
and sick ones (affected by traumatic arthritis) were 
evaluated by the presence of significant differenc- 
es" Although artificial neural network was sensi- 
tive to more errors in discriminating between 
healthy and sick pigs, it was meaningful to prove 
the feasibility by using screams'". In existing stud- 
ies, cough analysis is predominant among all kinds 
of pig sounds, especially wasting diseases and respi- 
ratory diseases. 

It could be found that early efforts focusing on 
pig cough detection in pig herds were undertaken 
under laboratory conditions from a successive study 
by Katholieke Universiteit Leuven in Bel- 


mľ® 313334. Ag demonstrated in the studies, the 


giu 
differences generated by the sound analysis con- 
firmed the variability of sound parameters depend- 
ing on the health status or disease of the animal. 
Due to lesions of the respiratory system, infectious 
cough sounds differed from non-infectious cough 
sounds. Also, it was proved feasible to complete the 
pig cough classification from various sounds during 
a series of trials. And then, the experiments were 
transferred to a commercial pig farm for further test- 
ing™™™, As expected, the most intuitive perfor- 
mance was the decrease in accuracy of cough detec- 
tion in a complex commercial farm compared to a 
controlled environment". The main reason was that 
background noise was the biggest interference fac- 
tor in sound detection. Although classification per- 
formance was much lower, it could still be regarded 
as an indicator of disease in pig farms. These re- 
searches made crucial contributions since the results 
targeted different frequency ranges of pig sounds 


and served as a basis for the development of more 


complicated models. 

Based on the large number of basic investiga- 
tions of pig coughs, Korea University focused on 
the study of wasting disease. The experiments were 
performed on a commercial pig farm and aimed at 
classifying the different porcine wasting diseases, 
such as Postweaning Multisystemic Wasting Syn- 
drome (PMWS), Porcine Reproductive and Respira- 
tory Syndrome (PRRS) virus, and Mycoplasma Hy- 


15.3738401 The research was 


opneumoniae (MH) 
shown to be robust against noises in pig herds. 
Moreover, a low-cost sound sensor system was sug- 
gested to be applied in small and medium-size 
farms with limited budgets “”. 

Previous researches have contributed signifi- 
cantly to the study of pig cough sounds. While the 
researches were conducted in the last decade in Chi- 
na. In the early stage, research on sound recognition 
of predelivery Meishan sow was tried in laboratory 
conditions. A team of Nanjing Agricultural Uni- 
versity transferred the experiment from the laborato- 
ry to a pig farm to collect sounds and conduct an in- 
depth study on the sound of Meishan sows"*!. Mean- 
while, Taiyuan University of Technology carried 
out a series of experiments with machine learning 
on pig cough characteristics, localization, recogni- 
tion algorithms, and multi-sensors co-monitor- 
ing“, Based on the ongoing development of tech- 
nology, Huazhong Agricultural University applied 
deep learning methods in pig cough recognition and 
achieved satisfying results *'. Besides the research 
teams mentioned above, many researchers in China 
are focusing on the performance improvement of 
cough sound recognition by adopting and finetun- 
ing various algorithms. Breakthroughs in technolo- 
gies, including machine learning and deep learning, 


are boosting the development of PLF. 
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2.4 Summary 


In summary, as an indicator of animal welfare, 
pig sound has been well-tested and some findings 
have been generated in progress. However, it is evi- 
dent that the current researches on pig sounds main- 
ly concentrate on coughs, screams and grunts. The 
meanings of other pig sounds and their relationship 
with welfare need to be further studied. Meanwhile, 
it is relatively straightforward that the majority of 
previous studies were targeted at improving a partic- 
ular type of pig welfare based on qualitative or quan- 
titative studies. Indeed, a specific sound points to 
multiple aspects of animal welfare. This situation 
dramatically increases the soundscape complexity, 
especially in the commercial pig farm. As a whole, 
the relationship between pig sound and welfare is a 
combination of multiple occurrences rather than 
changes caused by a single factor. This has become a 
major issue for animal welfare research based on pig 
sounds. Meanwhile, it deserves to be a significant 
study. For example, it is possible to distinguish dif- 
ferent coughing sounds as further management indi- 
cators. Whether the coughing sound is triggered by 
diseases (pig wasting diseases and respiratory diseas- 
es) or air pollutant in the pigsty. Such refinement 
would have a profound effect on the global improve- 


ment of the PLF and positive welfare. 
3 Sound analysis 


Benefit from the development of sensor tech- 
nology, animal welfare could be monitored in di- 
verse manners”, In general, monitoring sensors 
available in pig farms could be divided into two 
types, namely invasive sensors and non-invasive 
sensors. RFID and accelerometers are two common- 
ly used invasive sensors”™. The advantage of inva- 
sive sensors is that they satisfy the identification 


and tracking requirements of individual informa- 


tion. In contrast, the disadvantages are also apparent 
in two aspects. Injury, pain, and stress are brought 
to pigs when attaching a tag, which goes against ani- 
mal welfare. Another limitation is that the devices 
are not easy to maintain. Non-invasive equipment 
frequently used in pig farming include camera- 
based sensors, microphones, and infrared thermal 


cameras"”, 


The advantage is relatively easy to 
check equipment in time and to reduce the pressure 
on pigs. However, researches are still focused on 
the group monitoring level. Improving the accuracy 
of individual monitoring is one of the challenges of 
non-invasive equipment. 

Among the non-invasive equipment, a micro- 
phone has been adopted for pig sound recognition 
and welfare assessment benefitting from its non-in- 
vasive and continuous monitoring merits'’”!. Statisti- 
cally, the number of studies based on microphone 
technology is the fourth highest among the existing 
smart technologies". Welfare monitoring research 
are in high demand. Meanwhile, it shows relatively 
potential for commercial implementation due to its 
low cost of devices. It is an important component of 
PLF by combining technological advancements in 
the management process and animal behavior”), 

In general, the process of sound analysis con- 
56] 


sists of four steps, namely, sound recording”, indi- 


[57] 


vidual sounds labeling’”, sound feature extraction ©"! 


and classification ©”, as shown in Fig. 1. 
3.1 Sound recording and labeling 


The first step is the collection of raw data re- 
corded by microphone. Typically, the microphone 
was placed around the pen in a pig room. Its posi- 
tion was explored in field conditions in terms of 
height and the relative position from the walls and 
disruptive sound sources such as ventilation. Also, 


the sampling rate could be adjusted. In most cases, 
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Fig. 1 Flowchart of sound analysis 


44.100 kHz was used in the field condition. Another 
factor to consider is the number of microphones. 
Limited by experimental conditions, one micro- 
phone was adopted in most existing studies. Inevita- 
bly, the number of microphones could also affect 
the sound recording quality. Subsequently, the col- 
lected sound recordings need to be pre-processed to 
reduce background noises as much as possible for 
further processing and analysis. Pre-processing in- 
cludes filtering, pr-emphasis, framing, and window- 
ing, etc. In order to evaluate the performance of 
classification algorithms, it is necessary to segment 
and label the continuous recording data. The seg- 
mentation and labeling of individual sounds could 
be completed automatically or manually. Currently, 
most of the studies on pig sounds were implement- 
ed in a manual labeled way. While few studies were 
focused on the recording labeling. The double 
threshold endpoint detection method was frequently 
used in collecting individual sound segments from 
recordings’**, Besides, Li et al." adopted the bi- 
rectional long short-term memory-connectionist 
temporal classification (BLSTM-CTC) model to 
complete the continuous cough sound recognition 
task, with a total accuracy of 93.77%. 


Obviously, researches on labeling and extrac- 


tion of pig sound segments are lacking. Most stud- 
ies remained at the manual-labeled stage. This is 
partly because the research requires the involve- 
ment of experts in animal research to give certain 
criteria, but there is no unified standard yet. Anoth- 
er reason is that an open-source database of pig 
sounds is scarce. Due to the different characteristics 
of pig growth, it is necessary to capture information 
of breed, age, weight, as well as the environment 
and location where these sounds are produced. All 
these information would enrich the diversity and 
comprehensiveness of pig sounds. A sufficient num- 
ber of sound samples are needed to extract valuable 
information and to eliminate distracting sounds. Al- 
so, it is a key step to achieve high accuracy in the 


following sound analysis. 
3.2 Sound features extraction 


After labeling individual sounds from continu- 
ous recordings, the crucial step is to extract valu- 
able information from sound signals, which is 


termed as audio feature extraction” 


. In existing 
studies, various features were extracted from five 
domains, including time, frequency, cepstral coeffi- 
cients, time-frequency, and deep features, as shown 
in Fig. 2. 

Time-domain features are fundamental fea- 
tures which represent signal variation regarding 
time. Among them, duration and amplitude are of- 
ten chosen to explore the basic information and 
properties contained in sound itself. Duration is a 
kind of rhythm-based feature, which represents a 
regular recurrence of patterns over time. It was 
proved that the average duration for infectious and 
healthy coughs were 0.67 s and 0.43 s in the length 
of a single cough, representatively’'. While the 
maximum amplitude refers to the maximum ampli- 


tude of the sound wave, which was used to estimate 
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Duration Peak 
PSD 
Amplitude 
RMS Energy envelope 
STE Spectrum shape 
based features 
ZCR 


Tonality based 
features 


MFCCs — First/Second- 
order 


STFT 


Mel spectrogram 


LPCCs — _ First-order 


Note: Root mean square (RMS); short-time energy (STE) ; zero crossing rate (ZCR); power spectral density (PSD); Mel frequency ceps- 


trum coefficients (MFCCs ); linear prediction cepstral coefficient (LPCCs); short-time Fourier transform (STFT ) 


Fig. 2 Audio features used in analysis of pig sound 


the level of pain of piglets'*!. The results showed 
that maximum amplitude was growing from pain- 
free to castration, with the value ranging from 
0.2683 Pa to 1 Pa“. Root mean square (RMS) is 
an energy based features that can be used to mea- 
sure the sound loudness. The RMS value of non-in- 
fectious pig coughs has been proved to be higher 
than that of infectious pig coughs’". Another energy 
based feature called short-time energy (STE) is 
commonly combined with zero crossing rate (ZCR) 
to detect voice activity. It was found that the STE of 
piglets grunts was highly variable in distress””. 
Features extracted from the frequency domain 
are effective ways in conducting signal processing 
of pig sound. Simply, a peak is an index of the maxi- 
mum power of a signal. The average peak frequen- 
cies of infected and healthy pig coughs were calcu- 
lated of 618 Hz and 1603 Hz", which means it can 
be used to effectively distinguish the coughs of in- 
fected and healthy pig. Tonality based features in 
terms of fundamental frequency and pitch were uti- 
lized to monitor pig vocalization, especially for de- 
tecting whether the pigs were in normal. When the av- 
erage value of formants was lower than 2671.99 Hz 


and duration of the signal was lower than 0.28 s, the 


piglets were proven to be in normal condition". 


Otherwise, pigs could be considered in an abnormal 
state. Moreover, the pitch could help identifying 
sex, age, and distress”. It indicated that the pitch 
value of female pigs (218.2 Hz) was higher than 
male pigs (194.5 Hz) when all the pigs were in the 


63. Also, the pigs in nursery and 


same conditions 
growing stage held higher pitch, followed by the fin- 
ishing stage”. Spectrum shape based features were 
also used in recognizing various sounds suffered 
from different diseases, including spectral flux, 
spectral spread and spectral centroid'’*. In addi- 
tion to the features mentioned above, power spectral 
density (PSD) and energy envelope were another 
two common representative features in the frequen- 
cy domain, which were often used to distinguish pig 
coughs and non-cough sounds. 

The cepstrum is obtained by applying a Fouri- 
er inverse transform to the logarithm of the signal 
spectrum. Developed by Davis and Mermelstein, 
Mel frequency cepstrum coefficients (MFCCs) are 
commonly utilized in human speech and animal 
sound recognition™'!. Both the original coefficients 
and their first-order or second-order coefficients are 


added and combined as the acoustic features in the 
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process of feature extraction. For instance, the first 
20 coefficients were extracted as a whole feature 
vector for discriminating infectious coughs in 
pigs”. To reflect both static and dynamic character- 
istics, 12-dimensional original and 12-dimensional 
first-order delta coefficients were calculated from 
each cough sound sample". Furthermore, 39-di- 
mension MFCCs, combining 13-dimensional MF- 
CC and first-order as well as second-order differen- 
tial coefficients, were obtained for continuous pig 
cough sound recognition’. In addition, linear pre- 
diction cepstral coefficient (LPCC) and its first-or- 
der differences were also utilized in detecting abnor- 
mal status of dry and wet cough sounds”. 

For time-frequency features, one-dimensional 
audio signals are frequently transformed into two- 
dimensional time frequency representations. Among 
them, short-time Fourier transform (STFT) and Mel- 
STFT spectrograms are frequently used in pig 
sound recognition in two ways. One is that spectro- 
grams are combined with deep learning models. In 
this way, the process of hand-crafted features is not 
required. For instance, STFT spectrograms were ap- 
plied to Alexnet and MnasNet deep architectures, re- 


“11 While Mel spectrograms were adopt- 


spectively! 
ed to convolutional block attention module with 
convolutional neural networks (CNN) for recogniz- 
ing abnormal pigs sounds". The other way is ex- 
tracting deep features based on deep learning mod- 
els, which are regarded as feature extractors. Lee et 
al.'*" extracted deep features from a 6-layers CNN 
and put into muti-layer perception for pig wasting 
diseases classification”. A MFCC-CNN feature 
was extracted from a one-layer CNN and put into a 
support vector machine (SVM) classifier for pig 
cough recognition in Shen et al.” 

Up to now, most studies aimed at the time or 


frequency domain features of pig sounds. Fewer 


studies have been conducted on time-frequency do- 
main as well as deep features. Besides, other fea- 
tures could be considered in the pig sound analysis. 
For instance, harmonicity is utilized to distinguish 
tonal and noises, which have been used in bird 
sound classification, Spectrum shape based fea- 
tures have been used in music and animal sound 
classification, including spectral centroid, spectral 
roll off, spectral flatness, spectral bandwidth”. 
Moreover, other time-frequency representations 
could be investigated in time-frequency features 
and deep features, in terms of MFCC", mel-scaled 


spectrograms'”!, constant-Q transform (CQT)"*. 
3.3 Sound classification 


In previous researches, mathematical modeling 
of analyzing and classifying pig sounds can be di- 
vided into three main categories: statistical analysis, 
machine learning and deep learning. 

Statistical analysis was used to complete funda- 
mental research on the pig sound characteristics. 
For instance, one-way analysis of variance (ANO- 
VA) was one of the most frequently used statistical 
analyses'””!. It was shown that healthy coughs had 
much higher peak frequencies (750~1800 Hz) than 
infectious coughs (200~1100 Hz)”. Also, a signifi- 
cant difference (P<0.001) was observed between 
non-infectious coughs (a mean duration of 0.43 s) 
and infectious coughs (mean duration from 0.53 s to 
0.67 s)"". Thus, single cough duration could be re- 
garded as an indicator to classify different kinds of 
cough sounds”, Subsequently, ANOVA has been 
further used to distinguish pig wasting disease”. 
The results indicated that no differences in cough 
durations between normal coughs and coughs with 
diseases’”. In addition, there is a significant differ- 
ence between porcine circo virus type 2 (PCV2) and 


other coughs (normal, porcine reproductive and re- 
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spiratory syndrome (PRRS) and Mycoplasma hyo- 
pneumoniae (MH) cough sounds) in pitch, intensity, 
and formants 1, 2, 3, and 4°”. Not only cough 
sounds but also grunts and screams were analyzed 
using the statistical analysis to assess heat stress 
and evaluate the level of pain on pig farms". The 
results showed the differences in pig grunts and 
screams, which was beneficial for pig production 
management in a good welfare way!?”"!. 

Machine learning has demonstrated superior 
performance in many fields’. Fuzzy c-means clus- 
tering was used to form two clusters: cough and non- 


cough sounds ™! 


, including in laboratory installation 
with nebulization of citric acid"! and pathologic dis- 
ease and under aerial pollutant control design with 


ammonia, dust, and temperature"”. 


The overall per- 
formance of identified sounds (chemically induced 
coughs, sick coughs, and other sounds) achieved 
85.5%", Moreover, average cough classification 
achieved 94% in different experiment designs with 
various aerial pollutants''*!, SVM is suitable for clas- 


80 and 


sification in both linear and nonlinear ways 
was used in a variety of fields among pig sounds, es- 
pecially in pig cough recognition. It was shown that 
the average detection accuracy of wasting disease ap- 
proached 98.4%". Subsequently, Wang et al." pro- 
vided an average recognition rate of 95% for cough 
sounds in different air qualities. Shen et al. “” 
achieved a cough recognition of 97.72%. A decision 
tree was built for classifying diverse conditions in 
different status, such as thirst, hunger, and thermal 


"8 cold and pain”, as well as distress condi- 


stress 
tions””, 

Deep learning is a popular tool in recent years, 
contributing to its strong ability in pattern recogni- 


E4 Some deep learning models have been fine- 


tion 
tuned to be applied in pig sound recognition in 


terms of CNN and recurrent neural networks 


(RNN). For CNN models, Yin et al." finetuned 
Alexnet model to recognize the pig coughs, with an 
accuracy of 96.8%. Although CNN was proved to 
be effective in recognizing spectrograms, but CNN 
inevitably generated various redundant information 
during the process. For this reason, an attention 
mechanism named convolutional block attention 
module (CBAM) was introduced for optimizing 
CNN". The study provided a satisfying recognition 
rate of abnormal pig sounds with 94.46%", Since 
deep neural networks require greater computational 
capacity and higher hardware requirements, these 
conditions become one of the factors limiting pig 
sounds research into practical applications. To ad- 
dress this issue, researchers introduced lightweight 
models to pig sound classification. A lightweight 
model based on MnasNet and MobileNetV2 was 
used to classify pig sounds with different pig diseas- 
es, and got an Fl-score of 94.7%" and a total accu- 
racy of 97.3%", For RNN models, not only RNN 
but also its variant models including long short- 
term memory (LSTM), BLSTM, CTC and gate re- 
current unit (GRU) were applied in pig cough recog- 
nition“, The results proved that RNNs were 
able to be feasible and stable models for completing 


the classification task”, 
3.4 Summary 


Over the years, many bioacoustics models 
have been developed to analyze and recognize pig 
sounds. Statistical analysis is the scientific methods 
of collecting, exploring, and presenting large 
amounts of data to discover underlying patterns and 


e and it is a classical method for calculating 


trends 
variations among variables. A limitation of empiri- 
cal methods is that their applicability is often con- 
strained to the conditions during experimental test- 


ing. When conditions exceed the scope of the inves- 
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tigation, they may not be applicable. For instance, 
one-way ANOVA requires that the dependent vari- 
able is normally distributed in each group and that 
the within-group variability is similar across 
groups. However, it is better to ignore assuming the 
distribution of the data and employ it directly for 
prediction in solving practical sound classification 


problems"”. 


For comparison, machine learning 
(ML) provides complementary data modeling tech- 
niques and has become a more desirable approach 
to handling complex data sets. An advantage of ML 
is the flexibility, which means it contains a number 
of adjustable parameters. However, it also introduc- 
es certain complexity in the selection of parameters 
for better fitting the model. Meanwhile, the predic- 
tive results are relevant to the selected features, 
which becomes one of the factors sensitive to the 
choice of ML algorithm. To this end, deep learning 
models dominate great potential in addressing the 
problem of automatic extraction of abundant fea- 
tures from original data. However, deep learning re- 
lies on training and modification of the model by a 
large amount of data, leading to more complex sim- 
ulation and computation. Therefore, machine learn- 
ing is still a useful and continuously researched ap- 
proach. By studying the existing literature, it can be 
found that the selection of a classifier is still subject 
to a certain degree of randomness. The most suit- 
able classifier should be further validated on run- 
ning time for achieving the trade-off between accu- 


racy and processing speed. 


4 Challenges and perspectives 


4.1 Sound monitoring 


It could be found that a specific production 
phase was commonly targeted in pig sound analysis 
in Table 1. Specifically, fattening pigs hold the high- 


est percentage, followed by sows and piglets, which 


is consistent with Gómez et al“. Measurement and 
further validation of pig production stages are rea- 
sonably necessary and still lacking. On one hand, 
the diversity of pig sounds is present at all growth 
stages. In other words, monitoring typical sounds, 
such as coughs and screams, is required at each 
growth stage. Therefore, sound monitoring can be 
enhanced by expanding the range of study stages. 
On the other hand, due to the outbreak of African 
swine fever virus (ASFV), stricter management is 
conducted in big commercial pig farms and re- 
searchers are temporarily curtailed. As a result, it 
leads to intermittent experiments in pig farms. 
This situation will be improved as the epidemic 
eases. Meanwhile, continuing experiments can be 
considered from small and medium-sized pig 
farms in the future. 

In addition, with regard to pig sound localiza- 
tion, it is an important topic to be investigated in the 
future. On one hand, it locates the pig with healthy 
problems, which is better to enhance the manage- 
ment of pig herds. On the other hand, the relevant 
studies are fewer and still stand in the early explor- 
atory stage in the field. Currently, time difference of 
arrival (TDOA) between different microphones was 
applied in pig cough localization’’*”**™. Although 
the current positioning results are proved to be feasi- 
ble in pig houses (mean error less than 1 m), a chal- 
lenging issue also comes to the surface, namely the 
trade-off between the number of microphones and 
their cost. These results are instructive and meaning- 
ful during the experimental phase for further studies 
and the number of tested microphones is acceptable. 
However, when considered for application in a com- 
mercial context, the cost associated with each addi- 
tional microphone is undoubtedly expensive. This 
also motivates researchers to deepen the cough posi- 


tioning research and continuously optimize the ex- 
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periments. The aim is to achieve an optimal balance 
between the number of microphones, positioning ac- 
curacy and cost. Another problem is how to locate 
and track sick pigs in real-time, since the pig is a 
moving target. In addition, it is worthwhile to study 
how to solve the problem of multi-targeting local- 
ization when the number of targets with abnormal 
conditions is large. By far, sound recognition is also 
relatively difficult to locate from group recognition 
to individual recognition. Given that sound localiza- 
tion studies are still scarce, pig sound analysis is 
suggested to be developed for sound localization in 
the future. 

In general, the current researches were based 
on a laboratory or a specific pig farm with great dif- 
ferences in size, environment, and individual pig 
conditions. Despite modern technologies have been 
implemented to analyze the collected sounds from 
the barn, expected results are still not available at 
this stage. This is because the combination of the 
pig housing environment and the individual body 
condition affects the variability of the pig's vocaliza- 
tions. Therefore, the current methods are insuffi- 
cient for the complex interaction between pigs and 
their complex environment. This has become a ma- 
jor factor that makes it difficult to popularize PLF 
at present”, To address this issue, it is suggested 
that attention could be focused on interactions be- 
tween multiple monitoring modules rather than con- 
centrating on individual processes in the future. It is 
possible for us to find the most appropriate inter- 
face between multiple modules by interpreting mul- 
tiple sets of inputs from a variety of biological re- 
sponses. Moreover, it could be a better way to han- 
dle the whole PLF process. 

Cough based identification monitoring technol- 
ogy is still at a highly technology dependent stage 


of development. No integration of animal welfare 


has been considered to date. This could be due to a 
lack of well-developed rating system between the 
welfare indicators and the pig sounds. Hence, it is 
essential and extremely significant for applying to 
pig production. To address the weakness, the partici- 
pation and cooperation of researchers in different 
disciplines should be strengthen in the future. The 
latest study by Silva et al." focused on generating 
technical monitoring indicators by monitoring the 
frequency of coughs in the pig house and the corre- 
sponding disease diagnosis. Their preliminary find- 
ings validated those dry and non-productive coughs 
indicated the presence of Mycoplasma hyopneu- 
moniae. Although these researches are essential and 
significant, they have not yet been carried out in 
China. It is recommended to refine the details of ex- 
periments in pig farms by combining sound analysis 


and pig diseases in the future. 
4.2 Individual welfare 


Although microphone sensor-based sound lo- 
calization techniques are constantly being upgraded, 
they are only capable of narrowing the range of 
sound monitoring as much as possible. It is still dif- 
ficult to be precise about the individual pigs in this 
way. However, individual identification of pigs is 
necessary. As an example, when coughs are moni- 
tored to occur in a certain pen, it is important to 
identify and separate the coughs pig from the herd. 
Therefore, it is promising to aggregate different sen- 
sors together to promote individual pig welfare in 
the future. For instance, it could be attempted to 
identify unhealthy pigs by using a facial recognition 
system based on camera-based technologies to over- 
come the limitations of sound monitoring. In addi- 
tion, computer vision can provide information on 
behavioral interactions between individuals, includ- 


ing the detection of aggressive events and mood ele- 
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vating behaviors. Another promising approach is 
the application of remote video monitoring technol- 
ogy. Individual pig behaviors can be monitored vi- 
sually in this way, including body condition, lame- 


ness, feed intake, and oestrus ". 
4.3 Commercial applications 


Currently, most of the researches on pig vocal- 
ization monitoring are still in the developing stage, 
with few advanced commercial products in terms of 
SoundTalks”" and STREMODO system”. Sound- 
Talks is a cough monitoring developed by a Belgian 
company, which is used to measure cough sounds in 
an automated and continuous way. The STREMO- 
DO system proposed by Germany company records 
and assesses stress vocalization in pig group. Be- 
sides, Yingzi Technology established in China is a 
promising and new technology company™', which 
strives to create a multifaceted platform for data- 
driven agriculture and to facilitate pig farms devel- 
opment involving livestock management and biose- 
curity. The microphone-based method requires ex- 
tensive consideration of various factors such as vari- 
ability and diversity in the barn environment of pig 
herds. However, it is not easy to meet the demand 
for accuracy within technical reliability and low 


cost within equipment maintenance. 
4.4 Farmer concerns 


The goal of PLF will lead researchers to up- 
grade the technology into productization eventually. 
However, in addition to concentrating on technolo- 
gy, concerns from pig practitioners should also be 
taken into account. After all, the practitioners are 
the key to implement PLF in pig farming. Concerns 
from farmers are raised in a few aspects, such as in- 
ternal constraints in pig farms, technical costs, and 
technician support. Specifically, the application fea- 


sibility is also being tested by whether the hardware 


conditions in the farm meet the requirements. With 
suitable farm conditions, it is another concern 
whether the emerging technology can match the ex- 
pected results. In addition, the cost of application 
and maintenance is also worth considering. More- 
over, complex new knowledge may reduce motiva- 
tion to learn and prevent farmers from embracing 
new technology. 

Regarding these concerns, the following sug- 
gestions to consider in the future are proposed. 
First, further validation between PLF techniques 
and research into positive welfare indicators should 
be encouraged to enhance the confidence and trust 
of pig practitioners. Secondly, equipment should be 
designed and optimized from the view of farmers to 
provide convenience and reduce product costs. Fi- 
nally, the awareness of farmers should be raised by 
strengthening training and communications. Al- 
though PLF has not yet reached an industry consen- 
sus, it is highly topical. Moreover, pig practitioners 
are expecting PLF to assist in pig farming. 

Contactless sound analysis is a promising way 
to assess pig welfare. It is a key element of PLF and 
it has been proved to be a feasible way to promote 
the PLF. Apparently, it is ongoing progress in pig 
production and key technology. Overall, consider- 
ing the demand for farm intensification and better 
animal welfare, PLF could be a trend for livestock 
managers to boost their productivity and animal 


comfort. 
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GRRE, FRPP AIRE, WB ra Sy RE MA EAR a, A E BS) Ae ee tA He PT aR FB 1 Ra 
LAR AK IRMAK KETS, FETE AS Te Pr SOB ES HY SETAE He fh A OL Be HE GEV eR A 
BORAT ETAT HE, MFR EAA Ab ie. BRIERE HV BF EG PS = Te TP AB Ae Ah BR EAT 
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